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' ' 1. Introduction 

^ A very classical problem in dynamics consists in studying the Hamiltonian system 

m on the symplectic manifolds T*Y^ ^ T" x M" generated by the Hamiltonian 

ON 

255 i/e : T X T*T" = T X T" X M" — ^ M 

Cn {t,x,y)^^\\\yf^eG{t,x,y) (1) 

o 

(N 



2' 



where e is a small perturbation parameter. More general unperturbed systems 

• • /i(y) can be considered instead of ||2/|p/2, but we restrict to that particular case 

^. in the present paper in order to simplify some notations. For e = 0, the system is 

S^ integrable, and the momenta y are integrals of motion. For e > 0, these variables 

^ undergo small oscillations. KAM theory implies that these oscillations remain 

permanently bounded for many initial conditions. For other initial conditions, a 

large evolution might be possible. By Nekhoroshev theory, it must be extremely 

slow. The questions we discuss in the present text is whether this large evolution 

is actually possible, and to what geometric structures it is associated. 

Let us consider a resonant momentum yo — {lo, 0) € K™ xK*" = K", and assume 
that Iq is not resonant, which means that k-Io never belongs to Z for fc € Z™, fc ^ 0. 
In order to study the dynamics near the torus {y = yo}, it is useful to introduce 
the notations x = {0,q) € T'" x T, and y ^ {I,p) e M" xW\ m + r = n. In 

*Membrc de I'lUF 
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the neighborhood of the torus {y — yo}, the dynamics is approximated by the 
averaged system 

where 

Viq)= f Git,9,q,yo)dedt. 

JTxT™ 

Following a classical idea of Poincare and Arnold, we can try to exploit this obser- 
vation by considering the system 

Hit,0,qj,p) = ^\\pf + ^\\lf -eViq)~ fiR{t,9,qJ,p) (2) 

with a second perturbation parameter /z independent from e. We assume that V 
has a unique non-degenerate minimum, say at q — 0. Fixing e > 0, we can study 
this system for /u > small enough, which is a simpler problem which may give 
some hints about the dynamics of (1). The reason why instability is more easily 
proved in (2) than in (1) is the presence of the hyperbolic fixed point at (0, 0) of 
the {q,p) component of the averaged system. Studying (2) for /x > small enough 
is thus called the a priori unstable problem, or the a priori hyperbolic problem. 
In contrast, the Hamiltonian (1) is called a priori stable. The a priori unstable 
case is by now quite well understood for m — 1, see [46, 17, 18, 26, 5, 48] for 
example. The a priori unstable case for m > 1 and the a priori stable case can 
be considered as widely open, in spite of the important announcements of John 
Mather in [i']. The starting point in the study of (2) is the famous paper of 
Arnold, [ ]• In this paper, Arnold introduced a particular a priori unstable system 
where some geometric structures associated to diffusion, partially hyperbolic tori 
(that he called whiskered), their stable and unstable manifolds, and heteroclinic 
connections, can be almost explicitly described. This geometric structure have been 
called a transition chain. Most of the subsequent works on the a priori unstable 
problem have consisted in trying to find transition chains in more general cases, but 
understanding the general a priori unstable Hamiltonian have required a change 
of paradigm: from partially hyperbolic tori to normally hyperbolic cylinders. The 
variational methods introduced by John Mather in [' ] and Ugo Bessi in [ ] have 
also been very influential. 

Transforming the understanding gained on the dynamics of (2) to informations 
on the a priori stable case is not an easy task. Since we understand the system 
(2) when m = 1 the first attempt should be to study (1) in the neighborhood of 
an (n — l)-resonant line, for example the line consisting of momenta of the form 
y = (/, 0), / e E. We could hope to prove the existence of drift along such a line 
by using the a priori unstable approximations near each value of y. However, we 
face the problem that an approximation like (2) holds only in the neighborhood 
of the torus {y — (/, 0)} when the frequency / S M is irrational. Near the torus 
{y = (/, 0)} with / rational, one should use an approximation of the form 

H{t,x,y) = ^\\y\f - eW{x) - fiR{t,x,y) 
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and different methods must be used. This is often called the problem of double 
resonances when n — 2. We will call it the problem of maximal resonances. 

Our general goal in this paper is to study a priori unstable systems with a 
sufficient generality to be able to gain informations on the a priori stable case. 
We start with a relatively detailed description of the Arnold's example in Section 
2, which is also an occasion to settle some notations and introduce some impor- 
tant objects, like the partially hyperbolic tori, their stable and unstable manifolds, 
and the associated generating functions. Working with these generating functions 
allows to highlight the connections between the various classical approaches, geo- 
metric methods, variational methods, and weak KAM theory. Then, from the end 
of Section 2 to Section 3, we progressively generalize the setting and indicate how 
the methods introduced on the example of Arnold can be improved to face the new 
occurring difficulties. We present the Large Gap Problem, which prevents Arnold's 
mechanism from being directly applied to general a priori unstable systems, and 
explain how the presence of a normally hyperbolic cylinder can be used to solve 
this Problem and prove instability in general a priori unstable systems. In section 
4 we give a new result from [(>], on the existence of normally hyperbolic cylinders 
in the a priori stable situation, which should allow to apply the tools exposed in 
the previous sections to a priori stable systems. This suggests a possible strategy 
to prove the following conjecture: 

Conjecture 1. For a typical perturbation G, there exists two positive numbers ep 
and 5, such that, for each e e]0, eo[; The system (1) has an orbit 

{0{t),q{t),0{t),q{t)) :M — ^ T x T"-i x M x M""! 

such that sup( 9 — inff 9 > 5. 

We are currently working on this program in collaboration with Vadim Kaloshin 
and Ke Zhang. The same conjecture can be stated with a more general unperturbed 
system h{y), and the same proof should work provided h is convex and smooth. 
Our strategy of proof does not consist in solving the maximal resonance problem, 
but rather in observing that the conjectured statement can be reached without 
solving that difficulty. In that respect, what we expose is much easier than the 
project of Mather as announced in [42]. The result is weaker since only limited 
diffusion is obtained. The maximal resonance problem has to be solved in order 
to prove the existence of global diffusion on a whole resonant line, or from one 
resonant line to another. Our strategy, on the other hand, has the advantage of 
working with all n ^ 2, while Mather is limited to n = 2 at the moment. 



2. The example of Arnold and some extensions 

Following Arnold [I], we consider the Hamiltonian 

Hit,9,q,I,p) = ^\\pf + ^\\lf+ e{cos{27rq)-l)il + ^l fit, 9, q)) (3) 
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Wc will often use the corresponding 



e\\' + e{l - cos{2Trq)){l + f, fit, 9, q)). 

We will see e > as a fixed parameter, and discuss mainly the small parameter 
/x. When /i = 0, the variable / is an integral of motion. Our goal is to study its 
evolutions for ^ > 0. The form of the perturbation is chosen in such a way that 
the two-dimensional tori 

T(a) = T^ X {0} X {a} x {0}, a e M 

are invariant in the extended phase space, and carry a linear motion of frequency 
(l,a). By studying invariant manifolds attached to these invariant tori, Arnold 
discovered a remarkable diffusion mechanism, now called the Arnold Mechanism, 
that we are now going to describe. In the case /i = 0, the tori T{a) appear as 
the products of the hyperbolic fixed point {0, 0} of the pendulum in {q,p) by the 
invariant torus T x T x {a} of the integrable system in the (i, 8, 1) space. They are 
thus partially hyperbolic, and have stable and unstable manifolds, which coincide 
and can be given explicitly as 

W{a) = { (t, 0, q, a, ±dgSo{q)) : (t, 9, g) G T^} 

with 

So{q) = ^{l-COs{7Tq)). (4) 

n 

The coincidence and compactness of these stable manifolds is a very special feature 

of the unperturbed case ^ ~ 0. For ^ ^ 0, the tori T{a) still have stable and 

unstable manifolds which can be described as follows: There exists two functions 

S^Jt, 9,q) -.TxTx [-3/4, 3/4] ^ M, (5) 

which converge to ±5o when /i — ;■ 0, and such that the graphs 

W^ia)^{{t,9,q modl,a + deS^{t,e,q),d,S^{t,0,q))} 

are pieces of the stable and unstable manifolds of the torus T{a). More precisely, 
the set W^ [a) is negatively invariant under the extended Hamiltonian flow, and 



r(a) = fl^*(W+(a)) 



while the set W (a) is positively invariant under the extended Hamiltonian flow, 
and 

The functions 5*^ solve the Hamilton-Jacobi equation 

dtS + H{t, 9,q,a + dgS, d,S) = aV2, 
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which merely says that the invariant manifolds are contained in the energy level 
of the torus. The functions S^ have an expression in terms of the action: 



S+{t,9,q)= L{s,9+{s),q+{s),9+{s),q+{s))-a9+is) + ay2ds (6) 

^— OO 

S-{t,9,q)= / Lis,9-is),q-is),9-is),q-is)) - ae-{s) + ay2ds, 



where r is any real number such that r mod 1 = t, and {9^ {s) , q^ (s)) is the 
solution of the Euler-Lagrange equations such that 

9^ (r) = 9, 9± (r) =q mod 1, ^± (r) = a + OgS^ {t, 9, q),q^ (r) - dgS^ {t, 9, q). 

Note that the result does not depend on the choice of t. 

2.1. Homoclinic orbits. If (T, 6, Q) e T x Tx]l/4, 3/4[ is a critical point 
of the function 

Aa{t,9,q) = S+it,0,q) - S-{t,9,q -1), 

then the point 

(r,e,Q mod l,a + dgS+{T,e,Q),dgS+{T,e,Q)) 

=(r, e, (0 - 1) mod 1, a + ae^- (T, e, - i),d,s- (T, e, g - 1)) 

obviously belongs both to W^{a) and W^(a), hence it is a homoclinic point. It 
is a transversal homoclinic point if in addition the Hessian of A^ has rank two (it 
can not have rank 3 because the intersection is necessarily one-dimensional). It is 
not obvious at this point that the function A^ necessarily has critical points on 
the domain T x Tx]l/4, 3/4[. When /i is small enough, this follows from: 

Lemma 2. // (T, Q) is a critical point of the function A^ : (t, q) i — ;■ A.a(t, q, 1/2), 
then (T, Q, 1/2) is a critical point of Aa, hence the manifolds W~{a) and yV+(o) 
intersect above (T, 0,1/2) G T^. This homoclinic point is transversal if and only 
if the Hessian of Aa at {T,Q) is a non- degenerate 2x2 matrix. 

Note that the function A^, is defined on T^, and therefore it has critical points. 
Proof. We have dtS+iT,Q,l/2) = dtS-{T,Q, -1/2), let us denote by e this 
value. We also have deS+{T,Q, 1/2) = ^^^-(r, Q, -1/2), we denote by / this 
value. It is enough to prove that 9,5'+ (T, Q, 1/2) = ^^^"(r, Q, -1/2). In order 
to do so, it is enough to observe that dqS^ is the only non-negative solution of the 
equation 

e + F(r,e,l/2,a + /,.) =aV2, 

and that precisely the same characterization is true for dqS^{T,Q,— 1/2). Note 
that the equation above has two solutions, and that we can discriminate between 
them because we work in a perturbative setting which gives us rough informations 
on the signs. In more general situation, this is a source of difficulty. m 
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2.2. Heteroclinic orbits. We have proved the existence of homoclinic or- 
bits. But what is interesting for Arnold diffusion are heterocHnic orbits between 
different tori. We can deduce the existence of a heterocHnic orbit between T{a) 
and T{a') provided we can find a critical point of the function 

T xRx]l/A,3/A[3 {t,9,q) ^^ S+{t,9,q) - S-it,0,q - 1) + {a - a')e, 

where we have lifted the functions S without changing their names. As before, we 
can limit ourselves to finding critical points of the function 

Ea,a' -.TxRb {t,9)^^ S+it, 9, 1/2) - S- [t, 9, -1/2) + {a - a')9, (7) 

but the term (a — a')9 prevents us from finding them using a global variational 
method when a' ^ a. This refiects the fact that we are studying a non exact 
Lagrangian intersection problem. For ^ — 0, heteroclinics do not exist. However, 
recalling that Aa{t,9) = Aa(t,9, 1/2), we have: 

Lemma 3. // the function Aa{t,q) has a non- degenerate critical point, then the 
functions T,ay and S^' „ both have a non- degenerate critical point provided a' is 
sufficiently close to a. 

Proof. The theory of partial hyperbolicity implies that the stable and unstable 
manifolds 'W^{a) depend regularly on the parameter a. As a consequence, their 
generating functions S^ also regularly depend on a, and the functions Tja,a' depend 
regularly on a and a' . The result follows since "Sa^a — Aa- [3 

We say that ao,ai,...,afe is an elementary transition chain if the functions 
Soi_i,ai have non-degenerate critical points. We will sometimes use the same 
terminology for the different requirement that these functions have isolated local 
minima. From Lemma 3, we deduce: 

Proposition 1. Let ^ be given and sufficiently small. Let [a^ ,a'^] be an interval 
such that each of the functions Aa_^,a G [a^,a+] have a non- degenerate critical 
point, which means that each of the tori 7^ (a), a € [a^ , a+] has a transversal homo- 
clinic orbit. Then there exists an elementary transition chain a~ = oq, ai, . . . , a/c = 
a+. 

Proof. Let us consider the set A C [a^,a+] of points that can be reached 
from a~ by a transition chain. The set A is open : If a' € A, then there ex- 
ists a transition chain a^ = ao,ai,...afc = a' and, by Lemma 3, the sequence 
a^ — ao,ai, . . .0^,0^+1 is a transition chain when a^+i is sufficiently close to a. 
The set A is closed : Let a be in the closure of A. By Lemma 3, the pair a, a' is a 
transition chain when a' is close to a. Since a is in the closure of A, the point a' 
can be taken in A. Then, there exists a transition chain a^ = oq, . . . , Ofe = a', and 
then the longer sequence a~ — ao, . . . ,ak, Ofc+i = a is a transition chain between 
ao and a, hence a £ A. Being open, closed and not empty (it contains cq), the set 
A is equal to [a^ , a+]. q 
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The existence of transition chains impUes the existence of diffusion orbits. This 
is proved by Arnold invoking an "obstruction property" . This obstruction property 
is a characteristic of the local dynamics near the partially hyperbolic tori. It has 
been proved by Jean-Pierre Marco in [38], see also [21, 31]. The most appeahng 
way to understand the geometric shadowing of transition chains is to use the 
following statement of Jacky Cresson [22], which can be seen as a strong obstruction 
property: 

Lemma 4. // there exists a transversal heteroclinic between T(^a) and T{a') and 
a transversal heteroclinic between T{a') and T{a"), then there exists a transversal 
heteroclinic between T{a) and T(a"). 

This Lemma implies: 

Corollary 5. If ao,ai, . . . ,ak is an elementary transition chain, then there exists 
a transversal heteroclinic orbit between T(ao) and T{ak)- 

Putting everything together, we obtain: 

Theorem 1. Let fi be given and sufficiently small. Let [a^, a^] be an interval such 
that each of the functions A(j^^,a G [a^,a+] have a non-degenerate critical point. 
Then there exists a heteroclinic orbit between T{ar) and T{a'^). 

2.3. Poincare-Melnikov approximation. We have constructed diffu- 
sion orbits under the assumption that transversal homoclinics exist. We have 
proved that homoclinic orbits necessarily exist, and one may argue that transver- 
sality should hold for typical systems, we will come back on this later. However, it 
is useful to be able to check whether transversality holds in a given system. A clas- 
sical approach consists in proving the existence of non-degenerate critical points 
of the functions A^,^ defined in Lemma 3 by expanding them in power series of /i. 
As a starting point the generating functions S^ can be expanded as follows: 

St{t, 9, q) = So{q) + liM+{t, 9, q) + 0{fi'), (8) 

S-{t,9,q) = -So{q) - fiM-it,9,q) + 0{^^^), 

where So{q) ~ ^^(1 — cos(7r(7)) is the generating function of the unperturbed 
manifolds, and M* are the so-called Poincare-Melnikov integrals, 

t 

oo 

+ 00 



M+{t,e,q)^e / F('s,6l-}-a(s-t),-arctan(e2"'^(''-*Han(7rg/2))]ds 

/+°° / 2 \ 

f(s,9 + ais -t),- arctan (e^'^v^f*"^) tan(7rg/2)) jds 



where F{t,9,q) = (1 — cos{27Tq))f{t,9,q). To better understand these formula, it 
is worth recalling that 

s ^^ - arctan (^e^^V^s~t) tan(7rg/2)) 
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is the homoclinic orbit of the system ||p|p/2 + e(cos(27r(7) — 1) which takes the value 
q at time t. The formula above are similar to (6), but the integration is performed 
on unperturbed trajectories, which are explicitly known. For q g]1/4, 3/4[, we 
obtain: 

Aa{t,e,q) = S+{t,9,q)-S-it,0,q-l) = ^iMa{t,0,q) + O{^i^), 

where Ma is the Poincare-Melnikov integral 

Ma (i, 0, q) = M+ {t, 0,q) + M- {t,0,q-l) 

'(s,0 + a{s - t), - arctan (e^'^v^^'*"*) tan{Tr q/ 2))) ds. 

In the specific case studied by Arnold, where f{t,0,q) = cos(27r0) +cos(27ri), the 
Melnikov integral can be computed explicitly through residues, we obtain: 

Ma{t,q, 1/2) = " cos(2^0) + cos(2^i), 

sh(7ra/2-^e) sh(7r/2Ve) 

it has a non-degenerate minimum at {t,q) = (0,0). We can conclude, following 
Arnold: 

Theorem 2 (Arnold, [i.]). Let us consider the Hamiltonian (3) with f{t,0,q) = 
cos(27r0) + cos(27rt) and fi > small enough. Given two real numbers a^ < a^ , 
there exists an orbit {0{t),q{t), I{t),p(t)) and a time T > such that /(O) ^ a'~ 
andI{T) > a+. 

2.4. Bessi's variational mechanism. Ugo Bessi introduced in [9] a very 
interesting approach to study the system (3), see also [10, ) 1]. In order to describe 
this approach, let us define the function 

Aa : M X Tx]l/4, 3/4[xM x Tx]l/4, 3/4[ — > M 

{{ti,0i,qi), {t2,02, 92)) ^^ min / L(s, 0{s),qis),0is),q{s)) - a0{s) + a^ /2 ds, 

where the minimum is taken on the set of C^ curves {0{s), q{s)) : [^1,^2] — > T x M 
such that 

(0(ii),9(ti)) = (01,91-1) and {0{t2),qit2)) = {O2,q2). 

When the time interval ^2 —ti is very large, the minimizing trajectory in the defini- 
tion of Aa roughly looks like the concatenation of an orbit positively asymptotic to 
T{a) followed by an orbit negatively asymptotic to T{a). Using this observation, 
and recalling the formula (6), it is possible to prove that 

Aa{{tl,0l,qi),{t2 + k,02,q2)) -^ 

S+{t2 modl,02,q2)-S-{ti mod 1,^1,(71-1) 
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when k — y co. Fixing the real numbers ap, oi, . . . , a^ and the integers ti, . . . , Tfc, 
we consider the discrete action functional 

5*+ (ii mod 1, Oi mod 1, 1/2) + (cq - 01)6*1 
+Aa, i{h,9i mod 1, 1/2), (t2 + T2, 02 mod 1, 1/2)) + (oi - 02)^2 
+AaAit2,02 mod 1,1/2), {t3 + T3,03 mod 1, 1/2)) + (a2 " 03)^3 

H 

+Aa,_,{tk-i+Tk-i,ek-i mod 1,1/2), (ife,6lfc mod 1,1/2)) + (afc_i-afe)6'fe 
-5'-^_(ife mod 1,0k mod 1,1/2) 

defined on (] — 1, l[x] — 1, 1[)''. It is not hard to check that local minima of this 
discrete action functional give heteroclinics between the Torus T(ao) and the torus 
T{ak)- In order to prove that local minima exist, observe that this functional is 
approximated by 

5^ao,ai(ti mod 1,6*1) H hSQfe_i,Qj<fc modl,0fc) 

when the integers t^ are large enough, with the functions S as defined in (7). 
This limit functional has the remarkable structure that the variables {ti,Oi) are 
separated. This break-down of the action functional into a sum of independent 
functions is sometimes called an anti-integrable limit, it is related to the obstruction 
property of the invariant tori, to the A-Lemma, and to the Shilnokov's Lemma, 
see [] :]. The limit functional has an isolated local minimum provided each of 
the functions I]aj_i,ai has one, which is equivalent to say that 09,01, . . . ,Ofc is an 
elementary transition chain. In this case, the integers Ti can be chosen large enough 
so that the action functional above has a local minimum, which gives a heteroclinic 
orbit between T(oo) and T{ak). Technically, this method has several advantages. 
In our presentation we introduced the generating functions S^. of the invariant 
manifolds of the involved tori in order to stress the relations between the geometric 
and the variational method, and also because this works in a more general setting, 
see [ITi] for example. In our context and when /i > is small enough, it is easier to 
directly approximate the functions Aa in terms of the Melnikov integrals, and to 
use the following approximation for the action functional with large Ti and small 
/x without the intermediate step through S^: 

fJ-Maa (to mod 1, ^0 mod 1, 1/2) + (oi - 00)^0 H \- 

fiMa^itk mod 1,6'fc mod 1, 1/2) + (ofc - Ofe_i)6'fc. 

The corresponding calculations, performed in [ ], are much more elementary than 
those required to derive the expansions (8). 

2.5. Remarks on estimates. We have up to that point carefully avoided to 
discuss the subtle and important aspect of explicit estimates. In order to complete 
rigorously the proof of Theorem 2, we should prove the existence of a threshold 
fJ.Q{e) such that the Melnikov approximation holds, simultaneously for all a, when 
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< fi < lio{e). This can actually been done, with 

c_ 

/io(e) = e ^, 

but it is not simple, since it requires to study carefully the expansions of the 
functions S'^ and how the coefficients depend on a and e. This is related to the 
so-called splitting problem, see [A7]. As we mentioned above the approach of Bessi 
allows to prove that Theorem 2 holds for < /i < /io(e) without estimating the 
splitting. 

It is also important to give time estimates, that is to estimate the time needed 
for the variable / to perform a large evolution. Once again, this is closely related 
to the splitting estimates, although these can be avoided by using the method 
of Bessi. One should distinguish two different problems. Either we fix e, and 
try to estimate the time as a function of /i, or we take fi as a, function of e, say 
fi — fi(){e)/2, and try to estimate the time as a function of e. 

The second problem is especially important, because it is relevant for the study 
of the a priori stable problem. Once again, Ugo Bessi obtained the first estimate, 

c 

Estimating the time on examples allows to test the optimality of Nekhoroshev 
exponents, see [39, 36, 49] for works in that direction, see also [Hi] concerning the 
question of time estimates. 

It is worth mentioning also that in the first problem, estimating the time as a 
function of fj,, the estimate is polynomial, and not exponentially small. This was 
first understood by Pierre Lochak, and proved by Bessi's method in [2], where the 
estimate T — C / fx^ is given, see also [23]. The optimal estimate is T = C| ln^|/^, 
as was conjectured by Lochak in [35] and proved by Berti, Biasco and Bolle in [o], 
see also [7]. 

Returning to the question of the threshold of validity, let us discuss what hap- 
pens when /z is increased above /io(e)- The content of Section 2.3 on finding 
transversal homoclinics via the Poincare-Melnikov approximation breaks down, 
but the geometric constructions of the earlier sections is still valid. Theorem 1 
holds as long as the invariant tori 7"(a) remain partially hyperbolic, and that their 
stable and unstable manifold can be represented by generating functions like (5). 
Actually, the methods we are now going to expose allow even to relax this last 
assumption. Being able to treat larger values of /.t is especially important in view 
of the possible applicability to the a priori stable problem. 

2.6. Higher dimensions. Let us now discuss the following immediate gen- 
eralization in higher dimensions of Arnold's example: 

H{t,e,q,I,p) = ]^\\pf + ]^\\lf - eV{q)(l + ^l!{t,e,q)) 

with {t, 9, q,I,p) eTx T™ x T'^ x M™ x W, where V{q) is a non-negative function 
having a unique non-degenerate minimum at q — 0, with V{0) = 0. The main 
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difference with the example of Arnold appears for r > 1. In this case, the system 
is not integrable even for fj, = 0. There still exists a family of partially hyperbolic 
tori of dimension m, 

T{a) := {(i, 0, 0, a, 0), {t, 61) G T x T'"} 

parametrized by a G M'", but the system ||p||^/2 — £V{q) is not necessarily in- 
tegrable any more. As a consequence we do not know explicitly the stable and 
unstable manifolds of the hyperbolic fixed point (0,0), and so we do not have a 
perturbative setting to describe the stable and unstable manifolds of the hyperbolic 
tori T{a). This is also what happens for r = 1 if /i is not small enough. There is no 
obvious generalization of the generating functions S^ in that setting, because the 
stable and unstable manifolds are not necessarily graphs over a prescribed domain. 
The proof of the existence of homoclinic orbits as given in 2.1 thus breaks down. 
The existence of homoclinic orbits in that setting can still be proved by global 
variational methods, as is now quite well understood, see [14, 28, 20, 3, 27] for 
example. 

The proof is quite easy in our context, let us give a rapid sketch. We first define 
a function Aa similar to the one appearing in Section 2.4, but slightly different: 



Aa : M X T™ X ¥'■ X M X T" x T'' 



t2 

2 , 



((ti, 01, gi), (i2, 02, <Z2)) ^^ min / L{s, 9{s),q{s),0{s),q{s)) ~ a0(s) + a^ /2 ds, 

where the minimum is taken on the set of curves {9{s),q{s)) : [^1,^2] — > T™ x T'' 
such that {9{ti),q{ti)) — {di,qi) for i = 1 or 2. Let us set 

^(a):= liminf A,((0, 0, 90), (fc, 0, gi)), 

Ngfc — >-oo 

and consider a sequence of minimizing extremals 

{e,{t),q,{t)):[0,h]^T^xr 
such that {0i{O),q,{O)) = (0, go), {9i{ki),q^{ki)) = (0, gi), h — > 00, and 

ki 

L{s, 9i{s),q,{s),9i{s),q,{s)) - a9,{s) + a^ jl ds > ^{a). 



Let M be a submanifold of T x T™ x T'' which separates T x T™ x {qo} from 
T X T™ X {qi}, and let Ti e [0, h] be a time such that {T, mod 1, 9i(T,), qi{Ti)) e 
M, and let Ti be the integer part of Tj. It is not hard to check that the curves 
(9i{t — Ti) , qi(t — Ti)) converge (up to a subsequence) uniformly on compact sets to 
a limit {9oo{t),qoo{t)) : R — > T" x T'^. This limit curve satisfies 

-^(s, 6*00(3), goo (s), ^00 (s), goo (s)) - a0oo(s) +a'^/2ds = C(a), (9) 

and the corresponding orbit is a heteroclinic from To (a) to 7i(a). We call mini- 
mizing heteroclinics (for the lifted system) those which have minimal action, or in 
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other words those which satisfy (9). In the original system (before taking the cov- 
ering), we call minimizing homoclinic orbit a homoclinic which lifts to a minimizing 
heteroclinic. 

Let us now try to establish some connections between the present discussion 
and the proof of the existence of homoclinic orbits given in 2.1. We define two 
functions on T x T'" x T'': 

S-{t,9,q) = -limm{ Aa{{t,e,q)Ak,0,qi)) (10) 

N9fc — s-oo 

S+it,e,q)^\immi Aa{iO,0,qo),it + k,9,q)). (11) 

N3k — >oo 



Note that 



S+iO,0,qi) = ~S-{0,0,qo)^aa). 



The functions S'^, whose definition is basic both in Mather's ([41]) and in Fathi's 
([29]) theory, share many features with those introduced in (5), that's why we use 
the same name. Let us state some of their properties: 

The function S~ is non-positive and it vanishes on T x T™ x {qi} (and only 
there). Moreover, it is smooth around this manifold, which is a transversally non- 
degenerate critical manifold. Let us chose a small 5 > 0. The set 

W/;c(a) - {{t,e,q,a + dgS-,d,S-it,9,q)), S-{t,e,q) > -d} 

is a positively invariant local stable manifold of 7i (a) . 

Similarly, S'^ is non- negative, it is null on T x T™ x {qo}, and smooth around 
it, and this critical manifold is transversally non-degenerate. The set 

Wit(«) -^ {{t,e,q,a + dgS:,d,S:{t,0,q)), 5+(t,0,g) <j} 

is a negatively invariant local unstable manifold of To (a) ■ 

The functions S'^ also have a global meaning. Let us give the details for 5*+. 
For each point (T, 8, Q), there exists a real number r e M and at least one solution 
{9{s),q{s)) : {~oo,t] — > T™ x T'' of the Euler-Lagrange equations such that (t 
mod 1, 9{T),q{T)) — (T, 6, Q), and which is calibrated by S^ in the following sense: 
The relation 

S+{t mod l,e{t),q{t))-S+(s mod l,e{s),q{s)) 
L{a, e{a),q{a)J{(7),q{<7)) - ae{a) + a'^/2da 



holds for all s < t ^ T. The corresponding orbit is asymptotic either to To (a) 
or to Ti(a) when s — > — cx). It is not easy in general to determine whether the 
asymptotic torus is To{a) or Ti(a) but the following Lemma is not hard to prove: 

Lemma 6. If S^{T,Q,Q) < ^(a), then each calibrated curve 

ie{s),q{s)) : (-00, t] — ^ T™ X r 
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satisfying (rinod l,0(T),g(r)) — (T,Q,Q), is a-asymptotic to 7o(a), and satisfies 
L{a, 9{a),q{a),0{a),q{a)) - ae(a) + a^/2da = S+{T, 9, Q). 



If the function 5"+ is differentiable at (T, 8, Q) then there is one and only one 
cahbrated curve as above, it is characterized by the equations 

e{T)=a + deSt{T,Q,Q), g(r) = a,5+(T,e, Q). 

Formally, the critical points of the difference 5*+ — S'^ correspond to heteroclinic 
orbits (in the lifted system). By studying a bit more carefully the relations between 
the calibrated curves and the differentiability properties of the functions S^ (which 
is one of the central aspects of Fathi's Weak KAM theory, see [29]), this idea can 
be made rigorous as follows: 

Lemma 7. // {T,Q,Q) is a local minimum of the function S^ ~ S~ , then both 
S^ and S^ are differentiable at the point (T, Q,Q), we have 

{T,e,Q,a + deS-,dgS-) = {T,Q,Q,a + d0S+,dgS+), 

and the orbit of this point is either a heteroclinic between To (a) and 7i(a) or a 
homoclinic to To (a) or to 7i(a) in the system lifted to the covering, and thus it 
projects to an orbit homoclinic to T{a) in the original system. 

Although it is not obvious a priori that a local minimum of the function S^ — S^ 
exists away from q = qo and q = qi, this follows from the existence of minimizing 
heteroclinics, that we already proved. More precisely, we have: 

• The minimal value of S'^ — S^ is f (a). 

• The point (T, 0,(5) is a global minimum of S^ — S^ if and only if either Q G 
{go, gi} or the orbit of the point (T, 6, Q, a + deS-,dqS^) = (T, 9, Q, a + 
dgS^ jdqS''^), is a minimizing heteroclinic between To (a) and 7]_{a). 

• The set of minima of the function 5*+ — S'^j^ properly contains T x T™ x {go} U 
T X T™ X {qi}. 

As a consequence, the trajectory {9{t),q{t),9{t),q(t)) is a minimizing hetero- 
clinic if and only if (5"+ - S'-)(imod 1, e{t),q{t)) = ^(a) for each t e M (and if q{t) 
is not identically qo or qi). This minimizing heteroclinic is called isolated if, for 
some t S M, the point {6{t),q{t)) is an isolated minimum of the function 

{0,q)^{S+-S-){t mod 1,0, q). 

Now we have proved that the stable and unstable manifolds of the torus T{a) 
necessarily intersect, let us suppose that there exists a compact and connected 
set A C M™ such that the intersection is transversal for a E A. By a continuity 
argument as in Proposition 1, we conclude that any two points a^ and a+ in A can 
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be connected by a transition chain, that is a sequence ao = a~ , ai , . . . , a„ = a+ such 
that the unstable manifold of T(ai_i) transversally intersects the stable manifold 
of T{ai). We would like to deduce the existence of a transversal heteroclinic orbit 
between T{a~) and T{a~^), but I do not know whether the higher codimensional 
analog of Cresson's transitivity Lemma 4 holds. However, the weaker obstruction 
property proved in [21, 31] is enough to imply the existence of orbits connecting 
any neighborhood of T{a~) to any neighborhood of T{a^). It is also possible to 
build shadowing orbits using a variational approach. We need the slightly different 
assumption that A C M™ is a compact connected set such that, for all a G A, all the 
minimizing homoclinics of 7~(a) are isolated. For each a~ and a"*" in A, it is then 
possible to construct by a variational method similar to Section 2.4 a heteroclinic 
orbit between T{a~) and T{a^)- 



3. The general a priori unstable case 

A very specific feature of all the examples studied so far is that the perturbation 
preserves the partially hyperbolic invariant tori T{a), a G M™. We now discuss the 
general a priori unstable system (2). 

3.1. The Large Gap Problem. Let us assume that r = 1 and try to 

apply the method of Section 2. There is no explicit invariant torus any more, but 
KAM methods can be applied to prove the existence of many partially hyperbolic 
tori. More precisely, there exists a diffeomorphism 

close to the identity, such that an invariant quasiperiodic Torus 7^ (a) of frequency 
aj^(a) exists, and is close to T(a), provided the frequency uj^{a) satisfies some 
Diophantine condition. Moreover, for such values of a, the local stable and unstable 
manifolds ^^[{a) can be generated by functions 

Sl^{t, 0, g) : T X T" x [-3/4, 3/4] -^ M, 

as earlier. So we have exactly the same picture as in Section 2, except that the 
objects are defined only on a subset A^ C M™ of parameters. In order to reproduce 
the mechanism of Section 2, we must find elementary transitions chains ag, . . . , a^ 
in M™, with the additional requirement that ai € A^. It is necessary at this point 
to describe a bit more the set A^. Roughly, the KAM methods allow to prove the 
existence of the Torus 7^ (a) provided a belongs to 

A^^{a: fc.(l,c.^(a))>^ Vfc € Z^+i - {0}} 

for some constant r ^ ?7i + 1. This set A^^ is totally disconnected, hence it is not 
possible to apply a continuity method like in Proposition 1 in order to prove the 
existence of a transition chain. We must be more quantitative, which is possible 
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when /i is so small that the Poincare-Melnikov approximation is valid. In that 
regime, we have A^^^ « I^Ma, where Ma has a non-degenerate critical point. 
The conclusion of Lemma 3 can then be proved to hold under the more explicit 
condition that \\a' — a\\ ^ Cfi. In other words, the sequence Oq, ai, . . . , a^ is an 
elementary transition chain if \\ai — aj_i|| ^ Cii. However, the gaps in A^ have 
a width of size y^ > Cfi. As a consequence, for small ^, it seems impossible to 
build long transition chains, and the method fails. This is the Large Gap Problem, 
see [•]•')]. Even if there are classes of examples where the method can be applied 
because more tori exist in some regions of phase space, see [13, 19, 8] for example, 
the generic case seems out of range. 

3.2. Normally hyperbolic invariant cylinder. The Large Gap prob- 
lem has now been solved, at least in the case where m = 1, see [4G, 17, 18, 2G, 5, 44]. 
We will not discuss and compare all these solutions here, but just expose some gen- 
eral ideas which arise from them. 

An important new point of view is to focus on the whole cylinder C = UaT(a) 
rather than on each of the tori T{a) individually. This cylinder is Normally hy- 
perbolic in the sense of [34, 30], and thus it is preserved in the perturbed system. 
This new point of view is very natural, it appears in [43, 24], and then in many 
other papers. The deformed cylinder C^ contains all the preserved tori 7^ (a) ob- 
tained by KAM theory. The restricted dynamics is described by an a priori stable 
system on T x T™ x M™. If m > 1, we are confronted to our lack of understanding 
of the a priori stable situation. If tti = 1, however, the restricted system is the 
suspension of an area preserving twist map, and we can exploit the good under- 
standing of these systems given by Birkhoff theory which has also been interpreted 
(and extended) variationally in the works of Mather [40, 41]. We consider this 
case (m = 1) from now on. The invariant 2-tori which are graphs are of particular 
importance (they correspond to rotational invariant circles of the time-one map). 
To each of these invariant graphs, we can associate two real numbers, the rotation 
number w (defined from Poincare theory of circle homeomorphisms) , and the area 
a, which is the symplectic area of the domain of the cylinder C^ H {t ~ 0} delim- 
ited by the zero section and by the invariant graph under consideration. If a given 
invariant graph T of the restricted dynamics has irrational rotation number (or is 
completely periodic), then there is no other invariant graph with the same area 
a. We can take a two-covering and associate to this graph two functions S'^ by 
formula similar to (10). They generate the local stable and unstable manifold of 
the Torus T, the correspond to the global minima of the difference of the so-called 
barrier function 5*+ — S~ . Minimal homoclinics and isolated minimal homoclinics 
to T can be defined as in Section 2.6. The existence of minimal homoclinics can 
be proved basically in the same way as it was there. 

Definition 8. An invariant graph is called a transition torus if it has irrational 
rotation number (or if it is foliated by periodic orbits), and if all its minimal ho- 
moclinic orbits are isolated. 

Transition tori can be used to build transition chains in the same way as par- 
tially hyperbolic quasiperiodic tori with transversal homoclinics. Let A C M be 



16 Patrick Bernard 



the set of areas of transition tori. To each a G A is attached a unique transition 
torus 7^ (a) (note that this torus may be only Lipschitz, and is not necessarily 
quasiperiodic) . If A contains an interval [a^,a'^], then the existence of a hetero- 
clinic orbit between T^{a~) and 7^ (a"*") can be proved by already exposed methods 
(considering the way we have chosen our definitions, a variational method should 
be used, but a parallel geometric theory could certainly be given). 

In general, the set A is totally disconnected, and transition chains can't be 
obtained by a simple continuity method. If we make the additional hypothesis 
that all invariant graphs of the restricted dynamics are transition tori, then the set 
A is closed and a connected component ]a~ , a+ [ of its complement corresponds to a 
"region of instability" of the restricted system in the terminology of Birkhoff. More 
precisely, the tori 7^(a~) and T^{a^) enclose a cylinder which does not contain any 
invariant graph. The theory of Birkhoff then implies that there exist orbits of the 
restricted dynamics connecting an arbitrarily small neighborhood of 7^ (a~ ) to an 
arbitrarily small neighborhood of l'^{a^). This gives an indication about how to 
solve the large gap problem: use the Birkhoff orbits to cross regions of instability, 
and the Arnold homoclinic mechanism to cross transition circles. It is by no means 
obvious to prove the existence of actual orbits shadowing that kind of structure. In 
order to do so, one should first put these mechanisms into a common framework. 
The variational framework seems appropriate, although a geometric approach is 
also possible. The Birkhoff theory was described and extended using variational 
methods by Mather in [40], and he proposed a new variational formalism adapted to 
higher dimensional situations in [41]. On the other hand, Bessi's method indicates 
how to put Arnold's mechanism into a variational framework. These heuristics 
lead to: 

Theorem 3. Let [a^ ^a^] be a given interval. If all the invariant graphs of area 
a € [a^iO"*"] of the restricted dynamics are transition tori, then there exists an 
orbit {e{t),q{t),e{t),q{t)) and a time T > such that ^(0) s$ a" and e{T) > a+. 

This theorem is proved using variational methods and weak KAM theory in 
[5], Section 11, where it is deduced from more general abstract results. It also 
almost follows from [ ], Theorem 5.1, which is another general abstract result 
proved by elaborations on Mather's variational methods [4i], see also [4]. Applying 
that result of Cheng and Yan, however, would require a minor additional generic 
hypothesis on the restricted dynamics. In the case where r = 1, a slightly weaker 
version of Theorem 3 could also be deduced from the earlier paper of Chen and 
Yan [IV]. Under different sets of hypotheses, results in the same spirit have been 
obtained by geometric methods in [32, 33]. At the moment, these methods do not 
reach statements as general as Theorem 3, but they apply in contexts where the 
variational methods can't be used. 

The following variant of Theorem 3 may deserve attention in connection to 
the Arnold Mechanism: Assume that a~ and a"*" belong to A, or in other words 
that there exist transition tori T^{d^). These tori enclose a compact invariant 
piece C^[a^ ,a'^] of the invariant cylinder. If all the invariant graphs contained in 
C^[a~ ,a'^] are transition tori, then we say that C^[a^ ,a^] is a transition channel. 
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The proof of Theorem 3 also hxiplies that, if C)j,[a~,a'^] is a transition channel, 
then there exists a heterochnic orbit connecting 7^(a^) to 7^(a+). 

Theorem 3 proves the existence of diffusion under "exphcit" conditions. These 
conditions are hard to check on a given system, but they seem to hold for typical 
systems. It is much harder than one may expect to prove a precise statement in that 
direction, but it was achieved by Cheng and Yan in [17, 18]. The main difficulty 
comes from the condition on the isolated minimal homoclinics. Actually, it is 
not hard to prove that the homoclinics to a given torus are isolated for a typical 
perturbation, but we need the condition to hold for all the tori simultaneously. 
Since there are uncountably many tori, it is necessary to understand the regularity 
of the map a i — > S^ . Recall that the functions S^ are well-defined provided 
there exists an invariant graph of area a which has irrational rotation number or 
is foliated by periodic orbits. We call A this set of areas, it contains A. Cheng 
and Yan prove that the map a i — > S^ is Holder continuous on A, and deduce the 
genericity result using an unpublished idea of John Mather. 



4. Back to the a priori stable case 

The main objects in Arnold's mechanism are partially hyperbolic tori, that he 
called whiskered tori. It was proved by Treshchev [->m], that whiskered tori exist 
in the a priori stable situation, see also [27, 4-'3]. However, because of the Large 
Gap Problem, it seems difficult to prove directly the existence of transition chains 
made of whiskered tori. Actually, small transition chains do exist, because the 
density of KAM tori increases near a given one, but the length of these chains gets 
small when e gets small, hence these chains do not produce instability of the action 
variables in general. 

The modern paradigm on the a priori unstable case that we exposed in Section 
3.2 elects 3-dimensional normally hyperbolic invariant cylinders as the important 
structure. It is well-known that normally hyperbolic invariant cylinders exist in 
the a priori stable case. For example, each 2-dimcnsional whiskered torus has a 
center manifold, which is a 3-dimensional normally hyperbolic invariant cylinder, 
see e. 5. [12]. Actually, it is simpler to prove directly the existence of normally 
hyperbolic invariant cylinders, this involves no small divisors. However, the most 
direct proofs seem to produce "small" normally hyperbolic cylinders, which means 
that their size is getting small with e, so that we face the same problem as above 
when we had small transition chains. The main statement of [ ] is that "large" 
normally hyperbolic cylinders exist, meaning that their size is bounded from below 
independently of e. 

In order to be more specific, let us select a resonant momentum of the form 
2/0 = (-^^0)0) G K X K"~^, with /q Diophantine. Assuming that the correspond- 
ing averaged potential V has a unique minimum and that this minimum is non- 
degenerate, we have: 

Theorem 4 ([ ]). There exists two intervals [a'^,a'^] <Z J , J open, both indepen- 
dent from t, and Eq > such that, for e e]0, eo[ the following holds: 
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There exists a C^ map 

(Q, P) : T X T X J 9 (t, e, I) h-^ (Q(i, 9, 1),P{t, 6, 1)) e T""i x R"-^ 

such that the flow is tangent to the graph T of{Q, P). Moreover, there exist two real 
numbers ao < a^ and ai > a+ in J (which depend on e) such that the Treshchev 
tori T^ao) and Tifli) exist and are contained in T. The part Fj ofT delimited by 
these two tori is then a compact invariant manifold with boundary of the flow, it 
is normally hyperbolic. It is equivalent to say that it is partially hyperbolic with 
a central distribution equal to the tangent space of T. The inner dynamics is the 
suspension of an area-preserving twist map (where the area is the one induced from 
the ambient symplectic form). 

It is then reasonable to expect that, under generic additional hypotheses, Fj 
is a transition channel as defined in Section 3.2, and thus that T(ao) and T{ai) 
are connected by a heteroclinic orbit. We are currently exploring that program in 
collaboration with Vadim Kaloshin and Ke Zhang. It is important to observe that 
the map (Q, P) is not C^-close to (0, 0), and that the inner dynamics is not close 
to integrable. Fortunately, Theorem 3 allows such a generality. 
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